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Abstract 

By considering the "kinetic-energy" term of the minimum princi- 
ple for the Schrodinger equation as a measure of information, that 
minimum principle is viewed as a statistical estimation procedure, 
analogous to the manner in which Jaynes {Phys. Rev. ,106, 620, 1957) 
interpreted statistical mechanics. It is shown that the entropy for- 
mula of Boltzmann and Jaynes obey a property in common with the 
quantum-mechanical kinetic energy, in which both quantities are in- 
terpreted as measures of correlation. It is shown that this property 
is shared by the key terms in the minimum principles of relativis- 
tic quantum mechanics and General Relativity. It is shown how this 
principle may be extended to non-Riemannian nonEuclidean spaces, 
which leads to novel field equations for the torsion. 

1 INTRODUCTION 



As Laplace famously observed, Newtonian mechanics is deterministic to an 
idealized intelligent being, but (as Laplace observed less-famously in the same 
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passage) practical realities demand a probabilistic mechanics []. In the cen- 
tury after Laplace's observation, two forms of probabilistic mechanics were 
discovered: statistical mechanics, and quantum mechanics. While the for- 
mer maintained at a detailed level the determinism of Newtonian mechanics, 
acquiring its probabilistic nature only when describing macroscopic observa- 
tions of large ensembles of particles, quantum mechanics introduced a prob- 
abilistic nature at the most fundamental level. 

In a sharpening of Laplace's "principle of insufficient reason", Jaynes 
cast statistical mechanics into a novel form. Jaynes equated the entropy of 
statistical mechanics 

-^Pilogfe) (!) 

i 

to the entropy of Shannon's information theory, and equated the principle 
of maximum entropy with an information-theoretic inference law, Jaynes' 
principle, which asserts that the maximum entropy probability distribution 
is that distribution which is the least-biased estimate for such a distribution 

The goal here is to generalize Jaynes' principle to the continuous prob- 
ability distributions of quantum mechanics, and to demonstrate how the 
extremum principles of quantum mechanics, the Schrodinger equation and 
the Dirac equation, may be viewed as statements for formulating least-biased 
estimates of those continuous probability distributions. My thesis explored 
the conceptual implications of this view. In this work, we focus on the 
surprising consequence of this perspective that the laws of gravitation as de- 
fined by General Relativity may also be viewed as a minimum-information 
principle, and show how this leads naturally to a set of field equations for 
non-Riemannian nonEuclidean geometries. 



1 Laplace, P.-S., Laplace's Theorie Analytique des Probabilites, 3rd ed., Paris: Courcier, 
1820. Reprinted in volume 7 of his Oeuvres Completes, Paris: Gauthier-Villars, 1886, see 
the Introduction, pp. vi-ix. 

2 Jaynes, E.T., Phys. Rev., 106,620 (1957) 

3 Van Drie, J.H., thesis, Calif. Inst, of Tech., 1978, unpublished 



2 ENTROPY AS A MEASURE OF CORRELATION 



3 



2 ENTROPY AS A MEASURE OF COR- 
RELATION 

As pointed out in an earlier work |], the Boltzmann / Shannon formula for the 
entropy of a discrete distribution may be viewed as a measure of the corre- 
lation between two distinct distributions. This was demonstrated, by show- 
ing that the entropy functional is non-decreasing as "correlation-destroying 
transformations" are applied to the distribution. This perspective yields yet 
another view of Jaynes' principle: the least-biased distribution is that which 
displays the least correlation between the variables consistent with the con- 
straints. Also, this perspective leads to yet another view of the Second Law 
of thermodynamics: correlations tend to spontaneously decay, and are highly 
unlikely to arise spontaneously in an isolated system. 

It is useful to recall the fundamental importance that the concept of cor- 
relation played in Maxwell's original derivation of the velocity distribution 
of the atoms of an ideal gas f\ He placed two assumptions on the velocity 
distribution $(v x , v y , v z )d 3 v : (1) there should be no preferred orientation, 
®(v x , v y , v z )d 3 v = <&{v)d z v, where v 2 = v 2 + v 2 + v 2 , and (2) the veloci- 
ties along one direction should not be correlated with those along another 
direction, 

$(v x , v y , v z )dv x dv y dv z = <f)(v x )dv x <j)(vy)dvy<f)(v z )dv z (2) 

These assumptions lead to Maxwell's velocity distribution law, 

®{v x , v y , v z )d 3 v = exp(—av 2 )v 2 dvdVL (3) 

where a is a positive constant (shown by Boltzmann to be 1/kT), and dQ 
is the element of surface integration in velocity space, sm{vg)dvedv ( f ) . The 
ability of a distribution over multiple variables to be expressed as the prod- 
uct of distributions over single variables is the hallmark of an uncorrelated 
distribution. 



4 Van Dric, J.H., [http://xxx.laiil.gov/abs/math-ph/0001024 
5 Maxwell,J.C, Phil. Soc, 1860 
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3 KINETIC ENERGY AS A CORRELATION 
MEASURE 

The term in the Hamiltonian associated with the Schrodinger equation, 
< ^|A|^ > is commonly called the "kinetic energy" (multiplied by a suit- 
able units-dependent constant), by analogy to the corresponding term in the 
classical Hamiltonian, where A is the Laplacian operator, 

Q2 Q2 Q2 

A = — — - — - — (4) 

dx 2 dy 2 dz 2 

It was originally suggested by this author in unpublished work [], and 
later by Sears, Dinur and Parr [] that this expression represents an entropy 
expression. This assertion rested on intuitive arguments, leaving open the 
question "what mathematical property is common to both the kinetic en- 
ergy of quantum mechanics and the entropy of statistical mechanics?" . The 
answer which will be provided here is that both expressions are quantita- 
tive measures of correlation, or the lack thereof; the idea that the quantum 
mechanical kinetic energy in some instances measures correlation is an old 
concept from molecular quantum mechanics 

Let us consider two spaces M\ and M 2 , of dimension n\ and n 2 respec- 
tively, and the cross-product space Mi x M 2 of dimension n\ + n 2 . Further- 
more, let us consider a representation of the group of rotations and transla- 
tions on Mi,7i, and a representation on M 2l r j 2 . As in quantum mechanics, 
let us assume that a metric exists which allows us to define a probability 
distribution over Mk from jk, pk = (jkjjk) f° r eacn point in \ //,. One 
can construct the product representation, ji x 72, as a representation over 
Mi x M 2 . Such a representation on Mi x M 2 is by definition uncorrelated rel- 
ative to the variables Mi vis-a-vis M 2 , since it can be written as the product 
of a representation on Mi and one on M 2 . 

Consider an operator on representations of Mi,Oi, and an operator on 
representations of M 2 , and the composition of these on Mi x M 2 , Oi x2 . We 
assert that all such operators which obey the following relationship may be 



6 Van Drie, J.H., Candidacy report, Calif. Inst, of Tech., 1975 
7 Sears, S., Parr, R., Dinur, U., Isr. J. Chem., 19, 165 (1980) 
8 L6wdin,P.O., Adv. Chem. Phys., 2,207, (1959) 
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considered as measures of correlation between the variables of Mi and those 
of M 2 : 

lx2 = 1 + 2 (5) 

Denoting the expectation value of an operator Ok over the space M k 
against the representation 7^ by 

< Ik, Oklk >, which equals 

<lk,O k ^y k >= (7fc,O fc 7 fe ) (6) 

we see that for operators which are considered measures of correlation against 
Mi and M 2 , i.e. those operators obeying the relation ( H), 

< 71 x 7 2 , Oi x2 7i x 72 >=< 71O171 > + < 72O272 > (7) 

Considering these expectation values as measures of correlation of their 
corresponding representations, denoted J [7], this allows us to interpret the 
above equation to say that, for uncorrelated representations, the measure of 
correlation is additive: 

I[ 7l x 72 ] =/[ 7l ] + I[ 72 ] (8) 

This same relationship holds for the Boltzmann / Shannon entropy of 
two discrete distributions, P = {pi}"=i and Q = {qj}]Lx, where the product 
distribution PQ = {piqj}™'™li, 

S(PQ) = S(P) + S(Q) (9) 

Note that the Laplacian operator, the kinetic energy term of the minimum 
principle for the Schrodinger equation, explicitly obeys the property ( |^) 
for the n translational variables {xfc}£ =1 of n-dimensonal space M, since for 
any m-dimensional subspace {xfc}^L 1 Mi, and its complement {xfc}^ =m+1 M 2 , 

Am = t^Mi + ^m 2 • 

Intriguingly, R. A. Fisher applied the term information to the expectation 

value of the Laplacian, in an obscure and unelaborated reference 0. 

9 Fisher, R.A., Stat. Methods and Sci. Inference, NY: Hafner, 1956, eqn. 155, but in 
the Introduction, he maintains this has no relation to the concepts in the Mathematical 
Theory of Communication, i.e. information theory 
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Even for 4-component spinor representations in relativistic spacetime, 
where the relevant quantum mechanical equation is the Dirac equation and 
the corresponding term in the minimum principle is the expectation value 
of the operator tp 0, the property ( |^) holds. For the flat spacetime metric 
g^v, a set of 4x4 matrices defined over the components of the spinors exist 
which obey the property 

l(ulu) = 7>(lnlv + lulf,) = g^v (io) 
and the operator tip may be written 

= (ID 

As with the Laplacian, the property ( []) is evident from the definition 
of this operator; hence even in relativistic spacetime, the minimum principle 
contains a term which we may call a measure of correlation. 



4 THE GENERALIZED JAYNES' PRINCI- 
PLE 

Jaynes asserted that maximizing the entropy — YlkPk log(Pfc) over all distri- 
butions pk subject to the constraints of a given energy E — E k p k and 
normalization J2k Pk — 1 ma Y be viewed as a statement that the distribution 
Pk is the least-biased distribution for p^ subject to these constraints. 



s Pk {-J2Pklog(pk) - Xi(J2 E kPk) (12) 

k k 



A 2 ($»} = (13) 



k 



^p k = aexp(-(3E k ) (14) 

where Ai and A 2 are Lagrangian multipliers, and 6 Pk denotes varying over all 
possible {pk}- Viewing the Laplacian operator as a measure of correlation in 
a representation 7, we assert that the minimum principle for the Schrodinger 

10 Feynman R.P., Quantum Electrodynamics, London: Benjamin/Cummings, 1962 
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equation may be viewed as the statement that 7 represents the least-biased 
representation subject to the constraints of normalization and in the presence 
of a potential V(x): 

5,{< 7, A 7 > -A x J V(x)( in ) -\ 2 J (7,7)} = (15) 

One of the many conceptual implications of this view is that it allows 
us to understand the physical basis for why the electron does not collapse 
onto the nucleus of an atom: the tendency to minimize its potential energy 
by withdrawing into the nucleus is counterbalanced by the tendency of its 
distribution to resist achieving such a highly-correlated state. 

An amusing application of this principle is the case where 7 is a vec- 
tor representation, the normal to the surface of a soap-film. Minimizing 
< 7, A7 > subject to the constraint that the film adhere to a specified 1- 
dimensional wire frame gives the equation for the equilibrium configuration 
of such surfaces. This tendency of minimizing < 7, A7 > to function like 
a surface tension can also be understood by recalling Maxwell's observation 
about the Laplacian: the Laplacian of a function is proportional to the dif- 
ference between that function and that function's average value over a ball 
of radius a property well-known in the numerical analysis of Laplace's 
equation. 

For nonEuclidean geometries, a more refined definition of the Laplacian 
must be used to ensure that the property ( ||) is satisfied. The generalized 
Laplacian of deRham, A = d5 + Sd, where d is the exterior derivative and 
5 = *d*, * the Hodge dual operator, may be tediously shown to possess the 
property ( [|), through the use of deRham's forms double. This allows us to 
write the most general form of this generalized Jaynes' principle, namely 

<5 7 {< 7 ,A 7 >} = (16) 

subject to constraints, among them < 7,7 >= 1, where 7 is understood to 
be any representation over a nonEuclidean space, and A is understood to be 
deRham's Laplacian. 

n Maxwell, J.C., Treatise on Electricity and Magnetism, vol. 1, New York: Dover, 1991, 
p. 31 

12 deRham, G., Varietes Differentiables, Paris: Hermann, 1960, p. 125 
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5 APPLICATION TO NONEUCLIDEAN GE- 
OMETRIES 

For a Riemannian geometry, deRham gave the explicit formula for his Lapla- 
cian applied to a tensor of arbitrary rank p0: 

Aa klk2 ... kp = -< fc2 ... fcp;l + E(-l)^l«, fcl ...£... fcp ( 17 ) 

+2 1 f(-i)^v..4.£...^ ^ 18 ) 

fl<U 

where denotes that subscript is dropped from the enumerated list of in- 
dices, and where deRham's notation of the covariant derivative Vj is replaced 
by the notation of Misner et al.|^|where the covariant derivative is denoted 
by ; a, and where Rp^ u is the Riemann curvature tensor. In spacetime, ap- 
plying deRham's Laplacian to the metric, g^ v , we see that the above formula 
reduces to 

Ag^ = V;a + R ^ ( 19 ) 
where R mui/ is the Ricci tensor, the contraction of the Riemann tensor. The 
term g^ a a is zero, by the covariant constancy of the metric, and hence the 
measure of correlation, the expectation value of the Laplacian, is 

< g^Ag^ >= J sTR^dr = J Rdr (20) 

where dr is the volume element of integration over spacetime, and R is the 
scalar curvature, the contraction of the Ricci tensor. Minimizing J Rdr over 
all metrics is the Hilbert variational principle, and leads to Einstein's equa- 
tions of General Relativity in free spacej^. 

Hence, the generalized Jaynes principle states that Einstein's equations 
of General Relativity in free space may be interpreted as asserting that the 
metric is the least-biased metric, or the minimally-correlated metric. 

13 deRham,i&z<i, p. 131 

14 Misner, C.W., Thorne, K.S., and Wheeler, J. A., Gravitation, San Francisco: 
W.H.Freeman, 1973 

15 Misner et at, ibid., Ch. 21 
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In the nonEuclidean spaces of Cartan, the fundamental quantities are not 
the metric, but rather the 1-forms of the repere mobile u; M and the connection 
1-forms Q. He showed that, for these more general nonEuclidean spaces, 
an additional invariant arises, the torsion; two such spaces are equivalent if 
both the torsion and curvature are equal. Cartan's structure equations define 
the torsion f2 M and curvature 

du^ + u^Au v = ft" (21) 
d< + u£At«£ = W v (22) 

For Riemannian spaces, the torsion is zero, and the problem of equivalence 
reduces to the study of the curvature form. While Einstein's equations of 
General Relativity allows one to write field equations for the curvature, the 
issue of field equations for torsion and curvature has received less attention. 

The natural extension of the above ideas to such spaces with torsion is 
to consider the following equations: 

5^{< >} = (23) 

where the minimization is taken over all orthonormal bases and connection 
forms u)%. Of the numerous possible forms to choose from, the basis 1-forms 
seems the most natural, in that, like the metric, they represent a generalized 
potential, derivatives of which lead to a generalized force; derivatives of gen- 
eralized forces can then be linked to source terms, like mass/energy density. 
The details of the above equation will be explored in a future work. 

6 IMPLICATIONS OF THIS EQUATION 

An interesting question is "Are the principles of structure independent of 
scale?". One of the reasons the study of fractals in biological settings has 
generated such enthusiasm is that it implicitly answers that question Yes 
over the scales ranging from the size of macromolecules to the size of plants. 

16 Slebodzinski, W., Exterior Forms and their Applications, Warsaw: Polish Scientific 
Publishers, 1970, section 124. This originated from Cartan, E., Ann. Ec. Norm. ,40, 325 
(1923), reprinted in his Oeuvres Completes, partie III, vol. 1, Paris: Gauthiers-Villars, p. 
659 
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This equation appears to suggest that the principles of structure may be 
independent of scale over an even wider range, from the atomic scale to the 
astrophysical scale. Of course, at each scale the forces that are relevant are 
different, and hence the resulting structures are different (the constraints 
that must be imposed in the generalized Jaynes principle). 



